IDIAP Martigny - Valais - Suisse Continuous Audio � Visual Speech Recognition
نویسنده
چکیده
We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We t a c kle the problem of joint temporal modelling of the acoustic and visual speech signals by applying Multi-Stream hidden Markov models. This approach allows the use of diierent temporal topologies and levels of stream integration and hence enables to model temporal dependencies more accurately. T h e system has been evaluated for a continuously spoken digit recognition task of 37 subjects.
منابع مشابه
IDIAP Martigny - Valais - Suisse Fast Object Detection using MLP and FFT
We propose a new technique that speeds up signi cantly the time needed by a trained network MLP in our case to detect a face in a large image We reformulate neural activities in the hidden layer of the MLP in terms of lter convolution enabling the use of Fourier transform for an e cient computation of the neural activities A formal proof and a complexity analysis are presented Finally some exam...
متن کاملIDIAP Martigny - Valais - Suisse Multi � Modal Data Fusion for Person Authentication
In the context of multi modal person authentication a set of experts face recognizer speaker recognizer etc give their opinion about the identity of an individual The opinions of the experts can be combined to form a nal decision rejecting or accepting the claim We show that the nal decision is a binary classi cation problem and propose to solve it by a Support Vector Machine SVM We compare our...
متن کاملIDIAP Martigny - Valais - Suisse Optimal Parameterization of Point Distribution Models Georg Thimm Juergen
We address the problem of determining the optimal model complexity for shape modeling This complexity is a compromise between model speci city and generality We show that the error of a model can be split into two components the model error and the tting error of which the rst one can be used to optimize the model complexity based on the speci c application This strategy improves over tradition...
متن کاملMartigny - Valais - Suisse Illumination � robust Pattern Matching using Distorted Histograms Georg
It is argued that global illumination should be modeled separately from other incidents that change the appearance of objects The e ects of intensity variations of the global illumination are discussed and constraints deduced that restrict the shape of a function that maps the histogram of a template to the histogram of an image location This approach is illustrated for simple pattern matching ...
متن کامل